Analysis of Correlation Based Dimension Reduction Methods

نویسندگان

  • Yong Joon Shin
  • Cheong Hee Park
چکیده

Dimension reduction is an important topic in data mining and machine learning. Especially dimension reduction combined with feature fusion is an effective preprocessing step when the data are described by multiple feature sets. Canonical Correlation Analysis (CCA) and Discriminative Canonical Correlation Analysis (DCCA) are feature fusion methods based on correlation. However, they are different in that DCCA is a supervised method utilizing class label information, while CCA is an unsupervised method. It has been shown that the classification performance of DCCA is superior to that of CCA due to the discriminative power using class label information. On the other hand, Linear Discriminant Analysis (LDA) is a supervised dimension reduction method and it is known as a special case of CCA. In this paper, we analyze the relationship between DCCA and LDA, showing that the projective directions by DCCA are equal to the ones obtained from LDA with respect to an orthogonal transformation. Using the relation with LDA, we propose a new method that can enhance the performance of DCCA. The experimental results show that the proposed method exhibits better classification performance than the original DCCA.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple Sensor Image Registration, Image Fusion and Dimension Reduction of Earth Science Imagery

The goal of our project is to develop and evaluate image analysis methodologies for use on the ground or on-board spacecraft, particularly spacecraft constellations. Our focus is on developing methods to perform automatic registration and fusion of multisensor data representing multiple spatial, spectral and temporal resolutions, as well as dimension reduction of hyperspectral data. Feature ext...

متن کامل

Nearest neighbors and correlation dimension for dimensionality estimation. Application to factor analysis of real biological time series data

Determining the number of components in dimensionality reduction techniques is still one of the open problems of research on data analysis. These methods are often used in knowledge extraction of multivariate great dimensional data, but very often the number of components is assumed to be known. One of the classical methods to estimate this dimensionality is based on the Principal Components An...

متن کامل

Analysis of Censored Survival Data with Dimension Reduction Methods‎: Tehran Lipid and Glucose Study

 ‎Cardiovascular diseases (CVDs) are the leading cause of death worldwide‎. ‎To specify an appropriate model to determine the risk of CVD and predict survival rate‎, ‎users are required to specify a functional form which relates the outcome variables to the input ones‎. ‎In this paper‎, ‎we proposed a dimension reduction method using a general model‎, ‎which includes many widely used survival m...

متن کامل

Feature Extraction and Efficiency Comparison Using Dimension Reduction Methods in Sentiment Analysis Context

Nowadays, users can share their ideas and opinions with widespread access to the Internet and especially social networks. On the other hand, the analysis of people's feelings and ideas can play a significant role in the decision making of organizations and producers. Hence, sentiment analysis or opinion mining is an important field in natural language processing. One of the most common ways to ...

متن کامل

A Novel Dimension Reduction Technique based on Correlation Coefficient

In this paper, a novel simple dimension reduction technique for classification is proposed based on correlation coefficient. Existing dimension reduction techniques like LDA is known for capturing the most discriminant features of the data in the projected space while PCA is known for preservin g the most descriptive ones after projection. Our novel technique integrates correlation coefficient ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Applied Mathematics and Computer Science

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2011